Picture for Chengsong Huang

Chengsong Huang

Training Data Efficiency in Multimodal Process Reward Models

Add code
Feb 05, 2026
Viaarxiv icon

Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing

Add code
Feb 03, 2026
Viaarxiv icon

Rethinking the Reranker: Boundary-Aware Evidence Selection for Robust Retrieval-Augmented Generation

Add code
Feb 03, 2026
Viaarxiv icon

TTCS: Test-Time Curriculum Synthesis for Self-Evolving

Add code
Jan 30, 2026
Viaarxiv icon

MoCo: A One-Stop Shop for Model Collaboration Research

Add code
Jan 29, 2026
Viaarxiv icon

RelayLLM: Efficient Reasoning via Collaborative Decoding

Add code
Jan 08, 2026
Viaarxiv icon

Benchmark^2: Systematic Evaluation of LLM Benchmarks

Add code
Jan 07, 2026
Viaarxiv icon

UniRel-R1: RL-tuned LLM Reasoning for Knowledge Graph Relational Question Answering

Add code
Dec 18, 2025
Viaarxiv icon

VisPlay: Self-Evolving Vision-Language Models from Images

Add code
Nov 19, 2025
Viaarxiv icon

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Add code
Sep 09, 2025
Viaarxiv icon